Overcoming Unknown Words
نویسندگان
چکیده
منابع مشابه
Pointing the Unknown Words
The problem of rare and unknown words is an important issue that can potentially effect the performance of many NLP systems, including both the traditional countbased and the deep learning models. We propose a novel way to deal with the rare and unseen words for the neural network models using attention. Our model uses two softmax layers in order to predict the next word in conditional language...
متن کاملProcessing Unknown Words in HPSG
The lexical acquisition system presented in this paper incrementally updates linguistic properties of unknown words inferred from their surrounding context by parsing sentences with an HPSG grammar for German. We employ a gradual, informationbased concept of “unknownness” providing a uniform treatment for the range of completely known to maximally unknown lexical entries. “Unknown” information ...
متن کاملUnderstanding of unknown medical words
We assume that unknown words with internal structure (affixed words or compounds) can provide speakers with linguistic cues as for their meaning, and thus help their decoding and understanding. To verify this hypothesis, we propose to work with a set of French medical words. These words are annotated by five annotators. Then, two kinds of analysis are performed: analysis of the evolution of und...
متن کاملSyntactic Processing of Unknown Words
A method for processing sentences which contain unknown words, i. e. words for which no lexical entry exists, is presented. There are three different stages of processing: 1. The sentence with the unknown word is parsed. There are no special requirements for the parsing algorithm, but the lexical lookup procedure needs to be modified. 2. Based on the syntactic structure of the parse, informatio...
متن کاملHandling Unknown Words in Arabic FST Morphology
A morphological analyser only recognizes words that it already knows in the lexical database. It needs, however, a way of sensing significant changes in the language in the form of newly borrowed or coined words with high frequency. We develop a finite-state morphological guesser in a pipelined methodology for extracting unknown words, lemmatizing them, and giving them a priority weight for inc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Educational & Psychological Sciences
سال: 2005
ISSN: 1726-5231
DOI: 10.12785/jeps/060109